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We study numerically various wave functions in a gauged matrix quantum me- 
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of N up to 10000. This system is a truncated and quenched version of = 4 SYM 
that serves as a minisuperspace approximation to the full = 4 SYM system. This 
setup encodes aspects of the geometry of the AdS dual in terms of joint eigenvalue 
distributions for the matrices in the large N limit. We analyze the problem of de- 
termining geometric measurements from these fluctuating distributions at finite N 
and how fast they approach to the large N limit. We treat this eigenvalue geometry 
information as a proxy for geometric calculations in quantum gravity in a description 
where gravity is an emergent phenomenon. Our results show that care is needed in 
choosing the observables that measure the geometry: different choices of observables 
give different answers, have different size fluctuations at finite N and they converge 
at different rates to the large N limit. We find that some natural choices of observ- 
ables are pathological at finite N for N sufficiently small. Finally, we note that the 
approach to the large N limit does not seem to follow the expected convergence in 
powers of from planar diagram arguments. Our evidence suggests that differ- 

ent powers of N appear, but convergence to large N is rather slow so the values of 
N we have explored might be too small to conclude this unambiguously. 
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I. INTRODUCTION 

The AdS/CFT correspondence has revolutionized how we think about strongly quantum 
field theories and quantum gravity [1]. Because of these discoveries, classical gravitational 
techniques can be used to produce non-trivial statements about strongly coupled field theo- 
ries. On the other hand, the manifest unitarity of the field theory seems to resolve (at least 
in principle), the information paradox related to the formation and evaporation of black 
holes [2]. 

Obviously, if the two sides are truly equivalent, we should ask that the quantum gravi- 
tational theory encode individual quanta and other phenomena that we associate to weakly 
coupled field theories. By the same token, we should be able to answer geometric questions 
posed in the gravity theory by studying the degrees of freedom of the quantum field theory 
directly. The usual AdS/CFT dictionary [3, 4] usually works as a black box: it permits us 
to compare calculations between the AdS and CFT setups, but it does not explain how the 
degrees of freedom of the field theory reorganize themselves as geometry. Nor does it tell 
us how geometry breaks down. What is needed is a theory of emergent geometry: a reason 
why higher dimensional geometry is the right answer for certain classes of questions. Solving 
the AdS/CFT correspondence requires doing this. The full answer to this question is still 
beyond what is currently known. However, a model that seem to capture some of these fea- 
tures is available [5] and can be used to explore the passage from a microscopic description 
of the degrees of freedom to collective degrees of freedom that represent geometry. 

Given a model of emergent geometry, one can then ask for the simplest geometric ques- 
tions, like what is the size of an object or geometric feature. This will be the main purpose 
of the present paper: to understand and explore how one can measure geometric properties 
of the dual spacetime directly in the quantum field theory. We will do this by Monte-Carlo 
simulations of a collection of minisuperspace wave functions in field theory, following the 
ideas presented in [6, 7]. Numerical methods to study gravity problems with a dual quan- 
tum mechanics have been advocated also in [8-14]. In particular these other studies try to 
match the black hole specific heat and perhaps even the radius of the horizon for a black 
hole made of DO branes in ten dimensions. Those simulations use the full N x N matrix 
degrees of freedom in a Euclidean time formulation, so taking large is problematic. Other 
approaches to study A^ = 4 SYM for large A^ numerically use clever truncations to matrix 
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quantum mechanics [15, 16], but they suffer from similar hmits on taking N large. 

The main problem at hand is that in order to compare gravity and the field theory, 
in the regime where gravity is described by a semiclassical expansion, we are required to 
solve a field theory at strong coupling. This is extremely hard in general and this is why 
numerical methods might offer a good insight into the strong coupling behavior of these field 
theories. However, as is well known, the gravitational limit also requires solving the field 
theory at large N. Current lattice methods can in principle solve four dimensional theories 
for moderate values oF ~ 10, but taking N large is too difficult. Remember that in the 
AdS/CFT correspondence the radius of the five sphere and also of the AdS sphere in Planck 
units scales like iV-^/'^ Thus, in order to have a resolution of the sphere which is many times 
the Planck length, let us say by an order of magnitude, we require four orders of magnitude 
on N . Such numbers are beyond what can be done with a full simulation. Clearly, some 
approximation to the full field theory problem is required that reduces effectively the number 
of degrees of freedom to something manageable. Only at this stage simulations become a 
viable option to analyze the problem. 

Progress in this direction was made in [5]. It was argued there that at strong coupling, 
when the J\f = A SYM field theory is compactified on a three-sphere, the configurations of 
commuting (constant) scalar matrices dominate the infrared dynamics. Toy models that 
exhibit this property were exhibited in [17], giving additional indirect evidence for this 
proposal in the case of = 4 SYM. 

The nature of the argument that leads to this conclusion requires various steps. First one 
argues that such classes of configurations are required for describing some supcrsymmetric 
states, so the truncation to such variables might be better behaved than naively expected. 
Afterwards one argues that the quartic scalar potential of the theory should be minimized. 
This second step is accomplished by systems of commuting matrices. The dominance of this 
configurations dynamically quenches the other off-diagonal degrees of freedom, meaning that 
they become heavy and their contributions can be ignored to some degree. 

The end product is a model describing only a handful of the degrees of freedom of the full 
A/" = 4 SYM theory, and since the other degrees of freedom are essentially ignored, we can 
call this setup a quenched minisuperspace approximation. We will use the name quenched 
in the rest of the paper. What has been found is that the distribution of eigenvalues of 
the commuting matrices (in this approximation) has a geometry of its own [5, 18]. The 
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matrix eigenvalues experience some quantum induced repulsion (similar to the VanderMonde 
repulsion of eigenvalues in matrix models [19]), arising from the volume of the gauge orbit 
of the configurations, and a confining potential due to the coupling to the curvature of the 
sphere (this is realized as an effective mass term for the eigenvalues). Repulsion wins at short 
distances, and the confining potential wins at long distances. Because of these competing 
effects one expects an equilibrium at finite radius and shape. 

In the large N thermodynamic limit the eigenvalues can form a geometric distribution 
that depends on the particular wave function of the system that is chosen. For the ground 
state, the wave function is known [5]. We will call this wave function ipo. For this wave 
function the eigenvalue distribution is an sub manifold of M^, where the radius is of order 
\/N in field theory units. Locality in is induced from locality in M^: the masses of 
off-diagonal degrees of freedom depend on the distances between eigenvalues in MP. Modes 
connecting nearby eigenvalues correspond to lighter degrees of freedom, whereas modes 
connecting distant eigenvalues correspond to very heavy degrees of freedom that decouple 
sooner. Hence the distance in is meaningfully describing a notion of locality: how coupled 
the different degrees of freedom are to each other when integrating out heavy fields. 

One can argue this way that this distribution describes the of the AdS^ x dual 
geometry. Other supersymmetric wave functions are obtained by multiplying the ground 
state wave function with holomorphic polynomials of the variables, properly symmetrized 
between the eigenvalues. This type of argument has also been generahzed to orbifolds [20, 21] 
and conifolds [22, 23], giving a large number of qualitative and quantitative tests of this idea. 
However, reproducing the exact geometry of supergravity from the field theory is still beyond 
the current understanding of the AdS/CFT dictionary. A particularly difficult question to 
answer is how different field theory degrees of freedom are encoded and localize in different 
parts of the AdS geometry, how the holographic bound is realized in practice [24] and in 
particular how to relate these questions to the holographic renormalization group [25, 26] 
(for a recent review see [27]). 

The upshot is that some geometric information can be extracted from these wave func- 
tions, but the said geometry contains only partial information about the full ten dimensional 
geometry that supergravity would see. Since to date there is no convincing way to extract 
the ten dimensional geometry directly from the field theory without assuming a correspon- 
dence, we will use the limited geometric information that can be derived from the field 
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theory in these approximations as a proxy for the full dual (quantum) geometry. Naturally, 
one can argue that this is the same problem of comparing apples to oranges and therefore 
the results are of limited value. Although this is technically true, a toy model that performs 
this reorganization of degrees of freedom into some type of geometry is extremely valuable 
because some of the conclusions might be universal enough that they apply even in the 
general case. A case in point is the usage of jV = 4 SYM at finite temperature as a proxy 
for finite temperature QCD, and the use of the gravity dual to SYM to make predictions 
about the viscosity of the quark gluon plasma [28] . 

Contrary to other approaches of quantum gravity that attempt to build a spacetime from 
nothing: like those growing spacetime from a point in the no-boundary proposal for the 
wave function of the universe [29], or the causal set approach to quantum gravity [30]; in this 
setup there is no unique preferred wave function, and different wave functions correspond 
to different geometries. Some wave functions might not correspond to a single geometry 
either: superposition of geometries is possible. These states would be like a Schrodinger cat 
state: a simple measurement would destroy the superposition. However, when we can barely 
understand a single geometry at this stage of the research program, studying this feature of 
superpositions of geometry is not desirable. The setup we arc describing is similar to string 
states as solutions of the string equations of motion in a background spacetime: for each 
vertex operator of dimension (1,1) (this is how we describe quantum states via the operator- 
state correspondence) one can associate a different motion to the string in the semiclassical 
setup, and that motion defines a geometric trajectory for the string. Neither of these motions 
is preferred a priori, except as described by some preparation of an experiment. What we 
are studying here is the same: how different wave functions in our quenched minisupcrspace 
description correspond to different geometries. Our ultimate goal is to establish a tractable 
dictionary between geometries and wave functions in this very limited context and to study 
how the fluctuations inherent to the wave function give fluctuations of geometric data. Our 
aim is not to be exhaustive, but to instead develop some intuition about emergent geometry 
with some simple wave functions that are easy to evaluate numerically. 

It has also been found that by changing the wave function, the geometry of the field theory 
degrees of freedom admits different topologies in the thermodynamic limit. Numerically this 
has been observed in [6] . This is also what is expected from the classification of half BPS 
states in the dual supergravity theory [31] , where topology changing transitions are allowed. 
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The simplest wave functions which do this are powers of determinants of one of the complex 
matrices Z (these are wave functions describing many giant gravitons on top of each other, at 
weak coupling they are described by similar formulae involving operators of the full matrices 
[32, 33]. At strong coupling, the giant gravitons condense into a topology change once the 
gravitational backreaction is taken into account). The matrix Z is a diagonal matrix. The 
eigenvalues represent one of the three complex coordinates determining ~ as a complex 
manifold. Such wave functions are of the form 

i,Q = det(Z)QVo (1) 

where Z + + iX^ for example, and Q is an arbitrary integer. 

When is finite, the system (with the wave function) should be considered as a version 
of quantum geometry at finite values of h. The quantity XjN plays such a role in the ^4^5' 
geometry. After all A"^/^ measures the size of AdS in Planck units. Thus, finite A" effects 
and how fiuctuations depend on A^ can be said to encode quantum gravity corrections. 

The idea of this paper is to gain some understanding of emergent quantum geometry. 
Since we do not have direct access to the ten dimensional geometry from current methods, 
we will use the quenched field theory geometry to explore the problem of convergence to 
large A" and the problem of determining the shape of the geometry at finite A". A similar 
problem has been studied in [34, 35] for the case of fuzzy geometries of brane embeddings 
(where the brane configurations are non-abelian) . In our case the notion of shape needs to 
be refined with respect to the thermodynamic limit. What we call geometry according to 
the prescription [5, 18] is characterized by the (support of the) large A^ density of eigenvalue 
distributions, properly rescaled by the size of the ground state, \fN . This can be shown to 
be a singular distribution of in the thermodynamic limit, namely the eigenvalues form a 
manifold of lower dimension, for some general class of BPS wave functions [5, 23]. However, 
this is just the saddle point description at large A" of the wave function. This procedure 
does not take into account that the eigenvalues are discrete, or that there are fiuctuations 
around this saddle point. 

At finite A^ we expect that there are corrections to this geometric saddle point approx- 
imation of the wave function. These quantum corrections will be due both to granularity 
of the constituents (the system has finitely many eigenvalues in R^), as well as due to the 
quantum spread of the wave function (so that the support of the wave function is not sin- 
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gular). The first one is non-perturbative in nature, as it provides a cutoff on the number 
of independent degrees of freedom characterizing the geometry. It also suggests that the 
system might have some 'granular' features. The second class of corrections can be used to 

represent the ordinary quantum fluctuations of the geometry. 

Given a wave function for these degrees of freedom, we would ask the following simple 
questions: 

1. What is the topology of the distribution? 

2. What are the geometric measurements of the holes of the geometry? 

3. How big arc the quantum fluctuations? 

Notice that when we consider N finite, the notion of topology and shape becomes fuzzy, 
due to both the fluctuations of the eigenvalues, as well as because of the discreteness of the 
eigenvalues themselves. First, the eigenvalues have some probability to be in any distribu- 
tion, except on sets of measure zero. Thus the support of the distribution is not a very 
useful notion of geometry, but maybe it is enough to describe the topology. However, if one 
is trying to find sets of measure zero numerically this might not be easy to extract from a 
simulation in the general case. Secondly, measurements on fluctuating object shapes will 
have some inherent randomness and there will be errors in measurements. If the particles 
can be in any spatial distribution, there are no minimal distances to define sizes either. Thus 
how one chooses to sample the distribution to define a notion of size becomes important as 
well. 

In such a situation, wc have to change the questions to something that makes sense 
computationally. We need observables such that in the thermodynamic limit (A^ oo) they 
recover the classical measurement problem with probability arbitrarily close to one at finite 
sampling (taking a single measurement on a typical configuration, within some prescribed 
error bars). Some observables are better suited to this task than others. Moreover, different 
observables will differ at finite by 1/N corrections (sec for example the discussion in 
[36] for fuzzy sphere membranes). So the real question we need to ask is what are the 
optimal choices of observables that characterize geometric measurements and how fast do 
they approach the thermodynamic large N limit? 

As we said above, we will approach this problem by doing numerical simulations. The 
basic setup for these simulations was described in [21] and [7]. The Monte-Carlo model we 
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use simulates the probability distribution of eigenvalues from the square of the quenched 
wave function (including the measure effects). This is very different in spirit than lattice 
approaches. It also is free from sign or phase problems. Also, because our matrices are 
diagonal, we have only 6A^ degrees of freedom, so N can be made large at moderate cost. 
Using the same Monte-Carlo code, we will address the questions above for a particularly 
simple class of wave functions, namely those described by equation 1. For these wave 
functions the geometry of the eigenvalues looks like a five-sphere with a hole in the center, 
so we are not studying the process of topology changes. We are fixing the topology at the 
beginning. The issue we will study seems very mundane: how do we measure the radius 
of the hole? Since these arc wave functions of fixed topology, we will not address the first 
question above at all: the topology is known so we do not need an algorithm to figure it out. 
We also have chosen the orientation of the feature on the sphere and do not have to orient 
the data to analyze the geometry. Given this, we can see by eye the topology on these simple 
situations by projecting the particles positions on the Z plane. The more general problem of 
also determining the topology for a random configuration with a complicated wave function 
would be hard: we would need to find a way to do pattern recognition on these distributions 
to define topology. Regarding measurement of size, we propose various definitions of the 
size of the simplest geometric features and we study their virtues and failings. In particular, 
to simplify matters further, we want to characterize the simplest such non-trivial feature: 
the radius of the hole compared to the radius of the sphere. In the end, from the different 
choices of definitions of the radius at finite N, one such class of measurements seems to give 
an optimal solution to the problem. For this particular class of observables we can then 
address the corrections in more detail. The definitions we use are simple to describe, 
but that does not mean that they are optimal. We can only say that they are optimal only 
within the choices we have. 

The paper is organized as follows. In section II we describe some basic aspects of the wave 
functions and statistical distributions that we simulate later on. In particular, we show how 
the thermodynamic limit should be taken: — >■ oo, keeping q = Q/N fixed. We also pay 
special attention to how a factor of N"^ appears in front of the energy for the thermodynamic 
limit after appropriate rescalings. We also describe various functions that allow us to define 
the size of the features of the geometry of the eigenvalue distributions. The definition of 
the size depends on taking a limit A; — > oo of such measurements, so that one expects a well 
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behaved 1/N expansion at fixed k. We also review the argument that leads to a singular 
distribution of eigenvalues in the saddle point approximations. In section III we describe 
our numerical results for the wave functions. We describe data sets in various types of 
analyses. First we describe individual data sets for fixed Q, A^. This lets us sec the topology 
of the configuration from the plots and we also see that the support of the distribution 
of eigenvalues if of codimension one at large A^, with fluctuations. We can also compare 
the different characterizations of the radius of the hole to see how big the fluctuations are 
on the various measurements and why some statistical characterizations of this feature are 
pathological (they are dominated by fluctuations). For the measurements that are well 
behaved we find that they converge at different rates, that one can mathematically prove 
some inequalities between them and that one such type of measurement converges faster and 
is optimal given our choices. Given this information, we then study the problem of varying 
N at fixed q — Q/N, to see how the large N limit is approached. Our numerical results 
show that the convergence to large N is slower than would be expected based on large N 
counting arguments. Finally, we also study the radius of the hole by varying q at fixed large 
N (N = 10000) and obtain a scaling exponent of the radius as a function of q for small q. 
These results are different than those that were found previously in [6], that were obtained 
for much smaller values of N. We then conclude. 

II. WAVE FUNCTIONS AND GEOMETRIC OBSERVABLES 

The matrix model of commuting hermitian matrices X" [5] leads (by diagonalization of 
the matrices) to a model of N particles in six dimensions whose quantum Hamiltonian is 
given by 



where the positions of the particles in and /^^ = Yl^^^ \ 

Ob 2 Ob j I IS 9j generalized 

Vandermonde determinant. The are the i-th eigenvalue of the matrix X"'. The six 
X variables per eigenvalue can be written in complex notation in terms of three complex 
numbers ^/2z"' — x'^°'~^ + ix^", which can be arranges as a vector z and their complex 
conjugates z per each eigenvalue. 




(2) 
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The ground state wave function is given by [5] 

ZiZi) (3) 



V'o(a;) = exp ^-^^ =exp(-^, 



The associated probabihty density for the particle positions is then given by 

P = l^olV = exp I - + J^log|fj - Xj\^ J (4) 

\ i i<j / 

It is also easy to show also that the wave function above is an eigenfunction of the 
following similar looking Hamiltonian 

i ^ 

with the derivatives being holomorphic and anti-holomorphic in the prescribed order. To 
prove this, one has to use the fact that //^ is a scaling function under rescalings of the z 
variables that leave z fixed (we are treating them as independent variables, rather than 
complex conjugate variables for that). The complete argument, which is beyond the scope 
of the present paper, is very similar to the one appearing in [5, 23]. 

The next set of simplest wave functions in this quenched dynamics are those associated to 
BPS states. They are described by holomorphic wave functions of the z variables multiplying 
the ground state wave function and these are symmetrized over the eigenvalues (this is is a 
discrete gauge symmetry of permutation of eigenvalues) . These are not exact wave functions 
of H, but they are argued to be close to exact wave functions of the system. However, 
it turns out that all the homogeneous holomorphic wave functions are eigcnfunctions of 
H, with energy E — Eq equal to the degree of the polynomial of the z. As such, the 
Hamiltonian H seems more appropriate to capture all BPS states exactly than since the 
holomorphic wave functions do not seem to be eigcnfunctions of except under certain 
approximations in the large N hmit [23] . In general, the set of BPS solutions of the classical 
equations of motion on are in one to one correspondence with the moduli space of vacua 
of supcrconformal field theories. The quantization of these configurations is a particular 
holomorphic quantization of the moduli space of vacua [22] , which requires a measure. The 
Hamiltonian H would give the exact answer for the chiral ring state energies and has the 
correct measure of the ground state wave function in the classical approximation to these 
solutions, which is the same quenched approximation we have been studying. This is the 
main reason to introduce it above. However, nothing else that we do depends on that fact. 
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The simplest wave functions that change the topology are those that involve many (max- 
imal) giant gravitons on top of each other. The giant gravitons in the type JIB gravity 
theory in AdS^ x are extended brane configuration with the same quantum numbers as 
gravitons [37] that wrap an inside S^. If one places many of them on top of each other 
one expects a throat to form in the geometry near the location of the branes and to excise 
the branes from the geometry and replace them by fluxes [31, 38, 39]. The new topological 
feature is associated to the removal of the branes and the capping of the region of spacetime 
where they have been removed. 

In the dual field theory these are described at weak couphng in [32] and can be related to 
a system of free fcrmions on a two-dimensional phase space [33, 40]. This picture, in terms 
of incompressible quantum droplets is recovered geometrically in the half-BPS geometries 
[31], where the exact geometries of the capping are understood for half BPS D- branes with 
arbitrary shapes. These giant graviton states have been extensively studied at weak coupling. 
A recent review of many of these results can be found in [41]. 

Now, just having the map of droplets to states and to gravity does not mean that we 
understand the ten dimensional geometry from field theory. It just gives us a correspondence 
of particular solutions, and our task would be to explain the ten dimensional geometry from 
this data: how did the extra dimensions arise and why is there locality, time bending, etc., 
in the geometries. This is too hard with present techniques, which is why we will look at 
the field theory probability distributions for the quenched wave functions directly. 

The wave functions we consider are given by 

tPq = det{Z'f^o (6) 

where — + iX"^ in terms of the matrices and Q is an integer determining the number 
of giant gravitons. These are also given by z\ coordinates. Notice that Q is an integer in 
order for the wave function to be single valued. The normalization of ip does not matter to 
determine the eigenvalue distribution, after all, wave functions are rays in a Hilbert space. 

The associated probability distribution of the eigenvalues (particles in six dimensions) is 
given by 

p~exp I - ^|f,|2 + g^log(|2;,i|') + ^log|f,-f,f J (7) 

Notice that this probability distribution can be interpreted as a Boltzmann gas of particles 
with two body repulsive logarithmic interactions in 6 dimensions, subject to a background 
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potential. The logarithmic repulsions in six dimensions are long range. Because of that, 
the associated statistical mechanical problem is not standard, li Q — the background 
potential is confining towards the origin. If Q 7^ 0, the background potential is confining 
in generic directions, but it is repulsive from the = locus with a logarithmic repulsion 
(this is, particles are repelled from the locus = x'^ = 0). 

In the gravity theory, we usually take the large N limit to go to the classical limit. In 
this case, we can take the large N limit of the Boltzmann gas which is a thermodynamic 
limit of large number of particles. In the thermodynamic limit we should be able to replace 
individual eigenvalues by a density of eigenvalues on MP, given by p{x). The probability 
density for the eigenvalue distribution is then given by 



p(p) oc exp ^- J (fxp{x) [x^ + Q\og{xl + xl)] + ^ jj (fx(fyp{x)p{y) log \x - 



(8) 



The density is constrained so that p > and J d^xp{x) — N. As shown in [18], the size of 
the distribution for Q = is of order ^/N . So we can rescale the coordinates x = \/Na and 
the density p{a)(fa = p{x)N~^d^x. In this way, we have that the normalized distribution p 
satisfies 

J p{a)d^a = 1 (9) 

If we do this substitution we find that 

p{p) oc exp (^-N^ J d^ap{a) [-cP + glog(a? + al)] +^ JJ d^ad%p{a)p{b) log \d-b\^^ 

(10) 

where q = Q/N. Notice that the extra numerical factors that arc obtained from the rescaling 
of X inside the logarithms give a constant, due to the constraint j p = 1. This can be 
absorbed in the normalization of p. In this new setup, we see that a limit should be smooth 
at large A?" if g is held fixed in the process. Also, the factor of N'^ in front of the exponential 
term shows that when A" — > 00 we should be able to use a saddle point configuration for p 
with confidence. It is also clear that 1/A"^ plays a similar role to h in semiclassical physics, 
where the saddle point equations to a Euclidean path integral involve finding the cxtrcma of 
exp^—S/h). l/N"^ can also be interpreted as a temperature, so large A^ cools the system and 
suppresses fiuctuations. This is exactly what is expected from the match to supergravity [1] 
and to the planar diagram counting [42]. 
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The equations of the saddle point are the following 

C - + qlog{al + 4) + J d%p{b) log \a-b\^ - (11) 

where C is a Lagrange multiplier enforcing the constraint (9). These equations are valid on 
the support of p. We can prove that these equations lead to a singular support for p. We 
do this by contradiction. 

Assume that a smooth locus for the support of p is allowed. In that case, we can differ- 
entiate the equation 11 six times, with the Laplacian in 6 dimensions to the cubed power. 
We find then that 

{V^fqlog{al + al) + J db''p{b)VHog\a -b\'' ^ (12) 

as the first two terms obviously are cancelled. Notice that 

log{al + al)c^\og{z')+log{z') (13) 

so that away from = v/e have that 

log(a} + al) = (log(^^) + \og{z')) = (14) 

because the function is harmonic: a sum of a holomorphic plus an antiholomorphic piece. 
Also, is is easy to show that (V^)^log(a — 6) oc 5^(a — b). With these, we would find that 
p = 0. This argument contradicts that / p = 1, so the assumption that p is smooth does 
not work. This argument has been given before in [5, 23], and we are presenting it here for 
completeness. 

What this indicates is that the density distribution of particles is singular. Since for 
q — the original probability distribution is spherically symmetric, it is natural to assume 
that the associated distribution is spherically symmetric. The radius was computed in [18]. 
This has also been checked numerically in [6, 7]. A generahzation to all Calabi-Yau cone 
conformal field theory singularities for this result can be found in [23], where it was seen that 
this procedure matches various other supergravity calculations in the absence of symmetry. 

For g > but very small, we expect that the distribution does not change much, except in 
the region 2;^ ~ 0. Thus we expect that the distribution gives a five dimensional manifold of 
IR^, possibly with a boundary, and that the region near z^ — has no density of eigenvalues 
present. After all, the effective potential repels all particles from the locus z^ — 0. This 
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indeed was seen numerically in [6] , but the treatment was not systematic. This paper studies 
this issue in detail. 

In this situation, we see from the saddle point equations that the radius of the hole should 
be a function of q in the large limit. However, for finite there are corrections. The 
corrections from fluctuations about the saddle point are of order because A^^ is playing 

the role of H in the saddle point approximation. There are also corrections due to granularity, 
for which there is currently no theoretical understanding as to how they systematically affect 
the correlations. 

The projection of the density of particles in the 12 plane gives us a disk with a hole in it. 
Let us assume that this picture from these simulations is correct geometrically. Therefore 
in the large N limit the saddle point density has no fluctuations (they are suppressed by 
1/N^). Because of this we should be able to define a geometric radius of the inner radius of 
the distribution at — > oo. This should be measurable. 

Now, we need to find a good geometric description of the size of the hole that can be 
evaluated numerically for configurations that are not strictly ai N — oo, but that have 
some fluctuations. This is, we want to find a radius function of the distribution that we can 
average over configurations 



We want the function ro(p, N) to be as well behaved as possible. This is, we want the 
fluctuations of tq to be small. There is more than one such function. Let us consider various 
options. 

The arguments of planar diagrams suggest that good functions are traces over matrices. 
These are given in the thermodynamic limit by integrals over p (equivalently, integrals over 
p). Consider for example the following numbers 



The quantities are natural. They weigh the points in the distribution by how far they are 
from the = locus. For very large k, these are dominates by the smallest value Tq that 
p can attain. These are also easy to compute numerically. This was the rationale for using 
them in our numerical codes and this choice was made before taking the data. We collected 
these for various k for many statistically independent configurations at different values of 
Q, N. We will explain the data later. 



ro{N,q) = {ro{p,N)) 



(15) 




(16) 
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Thus, we have our first definition of the radius Tq. This is given by 





2k\-4r: 



-1 

2k 



(17) 



Notice that this is an improvement over the definition of the radius given in [6] , where the 
average of the closest five points to the = locus were used. That definition does not 
seem to be well behaved at large N. Also, we are averaging a trace, so we can expect that 
the fiuctuations are of order (this is a typical planar counting argument and at this 

stage it is just a theoretical intuition to set the size of errors). Here we have a definition 
that is more democratic in the particles. The (s) superscript is a reminder that this is a 
simple definition. However, this definition runs into other problems that are not immediately 
obvious. This is that the average of the expression (16) can be infinite, giving us A'^^ = 0. 
This is because we are taking a double limit N — )■ oo, Q/N fixed /c — )■ oo, but there can 
be a problem with the order of limits, and indeed there is. Notice that when k > Q the 
distribution for a single eigenvalue has a zero a,t — in \z^\ oi order 2Q. But that the 
trace is given by l/\zl\'^'', which has a pole of order 2 A; in the corresponding variable. If 
k > Q the integral of the probability distribution over the position of the particle diverges. 
However, if we take Q/N fixed, and k < Q, we have a finite answer. Thus the limit radius 
A; — )■ oo would give 0, which is not right. This means that in this case we are not allowed to 
take the limit /c — >■ oo. In this situation, we would say the observable is dominated by large 
fiuctuations. We clearly want to avoid these classes of observables to define the shape. 

The next simplest expression is given by changing the place where we do the averaging 
in the above 



This is, we average a root of a trace, as opposed to take the root of the average of a trace. One 
can check explicitly that the problem with the poles in the trace dominating the expression 
disappears now. We are taking the root before averaging, so near the bad locus the behavior 
of the function is not badly behaved, so the system is not heavily weighed towards these 
configurations. Now we can now take k oo safely. The (r) symbol notation is a reminder 
to indicate that we are taking a root. If we take /c — >■ oo in a finite sum, one can show that 
the sum is dominated by the single eigenvalue closest to the z^ — locus, so that recovers a 
definition close in spirit to the one in [6] . It should be emphasized that we are not required 
to take A; — > oo. Each of these gives a definition of a radius of the configuration, so we can 




(18) 
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study the result for various k and collect them together. We should see convergence to a 
limit in the numerical exploration of these configurations. For our setups the k will vary 
between one and fourteen. If the inner radius is at 1/2 of the full size of the geometry, the 
suppression of the outer edge contribution relative to the inner edge in a given sum can be 
as large as (0.5)^* ~ 10~^ for the case of maximum k, but for particles sitting at r ~ l.lri„ 
the suppression is only a factor of 14 smaller, so they should still be contributing. Of course, 
if the inner radius is very small, the near inner radius region is also very small and the 
suppression of the other regions will be more pronounced. 

There is a third definition that is useful, and that is the following 

/ 2fc+2\ 1/2 

This is another way of removing the problem with the poles, and instead of a root of the 
single trace expressions, we are taking a ratio of these expressions for different k. In the 

ratio, if the numerator gets big when one particle goes towards the locus, the denominator 
also gets large at the same time, and in the observables the average is not unduly dominated 
by large fluctuations. 

It turns out that this last definition, from the ones we have given, is optimal. We will 
explain how this works in the next section, where we can see directly in the data why this 
is so. Notice moreover, that if we take N ^ oo first, one can use large N factorization (one 
expects small corrections in inverse powers of A^), so in the limit we also have that 

RI\n-.oo = lim (-^) = hm ^ (20) 

^^"^ Jk+l f'^'^ \Jk+l) 

SO it is clear that the different definitions are not too different from each other at large N. 
However, they behave very differently at finite N. 

All of these functions are chosen because when we take A?" — > oo at g fixed, they have 
(at least in principle) a theoretically controlled approach to large that can be evaluated 
around the saddle point configuration. There are other reasons to choose inverse powers 
of the radius: in the paper of Chen et al. [43] it was shown that inverse powers of these 
matrices also appear in the description of closed string states in the presence of backgrounds 
with topology changes. Also, the moments of the eigenvalue density distribution serve as a 
characteristic description of shape that determines the half BPS states, and these can also 
be measured gravitationally in the AdS dual geometry [44] ( these particular calculations 
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use the half BPS states in the free fermion reahzation of weak couphng, and only describe 
the ten dimensional geometry indirectly by comparing to the LLM solutions [31]). 

Since in our case the density does not go all the way to the zero region = 0, the 

moments of the distribution with inverse powers of the radius can also be taken (which is 
the definition we used). These are hard to read from the asymptotic solution in gravity, but 
we are not forbidden from doing that in the field theory geometry observables. 

III. NUMERICAL RESULTS 

The probability distributions described by equation (7) are easy to simulate. The algo- 
rithm used to calculate these configurations was described in detail in [6] . It operates via a 
standard metropolis algorithm, updating one particle at a time in the Boltzmann gas. The 
initial configuration is distributed randomly on a cube. The simulation waits for relaxation 
before taking data. The only new innovation for the current project is to record the sums 
of the inverse powers of the radius in the 12 plane of the configuration, as required by the 
discretized version of equation (16). 

A. Data sets at fixed Q, N 

The easiest way to get an idea of how the distribution of positions looks is to plot the 
values of the radius in the 12 plane, ri2 in various formats. For convenience, we also plot 
the spherical radius of the other four dimensions in R^, which we call r3456. An analysis of 
one configuration shows the following distributions, in the qualitative figure 1. 

Analyzing the second figure carefully, we notice that the inner radius of the distribution 
of particles in r has a sharp edge to the left of a sharp peak, and is finished with a small 
smooth tail. This tail can probably be attributed to 1/N corrections and it might be 
possible to define the radius of the hole as the locus of highest density. This is certainly 
geometric in character, but it depends on understanding the distribution of particles in the 
ensemble better. If we were to plot instead the number of points in bins divided by the 
radius, as would be associated to a rotationally invariant measure ri2dri2, the plot does 
not change significantly. The peak is enhanced a bit, and it moves a bit to the left. A 
definition of the radius of the hole based on this feature is interesting, but it is hard to 
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FIG. 1: The distribution of radii of the configuration. The plot on the left shows that r3456 can be 
considered as a function of ri2, as expected by the singular nature of the saddle point distribution. 
It also shows that the radius of the distribution makes sense, except for one outlying point that one 
can call a statistical fluctuation. The plot on the right bins the number of particles in ri2 values 
in increments of a half. One notices a strong peak in the distribution, but it does not happen at 
the inner edge of the distribution. The sample is taken with = 1000, Q = 100, from the final 
configuration. 



understand how it would approach the large limit without extensive modeling. Since we 
do not have an analytic understanding of these distributions yet, it is very hard to use their 
prominent features in a meaningful way to obtain a reasonable definition of the geometry. 
For illustration, should we use the density in six dimensions to define the radius? Or should 
we use the projection of the density distribution to the 12 plane? The choice would depend 
on which observable we need to evaluate and how it is realized geometrically in the ten 
dimensional dual geometry. Notice also that the definitions of the radius that we use in 
most cases do not suppress the peak very much relative to the inner radius. Because there 
are so many more particles there, one can expect that the radius measurements we are doing 
at = 14 are tracking the peak in the distribution above. The distributions above were 
plotted after the choice was made of which functions to average. 

In hindsight, other definitions of the radius might have been better choices. After all, 
we actually know how the distributions were generated, so there is in principle a lot more 
information encoded in the eigenvalue distributions. In a certain sense, we are purposefully 
ignoring this information except for some qualitative understanding. The main reason to do 
this is to think of the algorithm that generated the geometry as a black box: the passage 



19 



from wave functions to geometries is not really understood, so how the details of the wave 
function appear in the data are still obscured. Given enough data, one should be able -at 
lest in principle- to reconstruct the 'theory' from the output of the black box. What we have 
chosen to do is to use simple geometric observables that describe integrals over a geometry. 
If we had a more precise understanding of the data, this could be exploited to get a picture 
of local geometric information. 

The next step in our procedure is to examine the data of the moments of the inverse 
radius of the distribution and to check how big are the fluctuations, relative to the size of 
the moments for the same setup = 10000 and Q = 100. This is the choice that we made 
before looking at the data. In particular, we are at first more interested in seeing how large 
are the fluctuations of these observables. This will give us a rough idea of how far from the 
large N limit we really are. In a certain sense, this is one of the most important pieces of 
information we can gather. If we expect some precision on the local geometry, we want basic 
geometric quantities to have low variance. If the variance is high, individual pictures of the 
data will be very fuzzy relative to the large N limit. 

The results of this analysis appear in the flgure 2. 
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FIG. 2: On the left figure, the mean value of the moments aX. N = 10000, Q = 100, for A; = 1, . . . , 14. 
On the right, the statistical variance Sfk/fk- As can be seen, the variance in the values increase 
as we take k larger, and are already of order 10% for k = 14, which is a moderate value of k. 
This indicates that fluctuations start becoming important for relatively low values of k, even in the 
case where Q is large. For smaller Q the inner radius is smaller, and fluctuations of one outlying 
particle near the inner edge are even more important in Z^. 



The flgure shows that the variance of the observables depends on the quantum numbers 
of the observables. A look of the data suggests a power law behavior. We can test this in a 
log plot, shown in flgure 3. 
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FIG. 3: A log plot of the data in figure 2 

A fit to log suggests that 5fk/fk — Ak^-^^, but looking carefully at the data in figure 3 the 
power law behavior sets in after k — 5 and for low values of k the answer is different. Since 
we do not have a theory of these fluctuations, we can not do a fit that is sufficiently informed 
to know if this behavior is typical and represents a departure from the large limit or not. 
Also, one show notice that since all of the variables we are measuring are strictly positive, 
the distribution of the variables is not gaussian. For Sfk/fk sufficiently small this should 
not matter too much. However for large k this is important and therefore the measurement 
of the fluctuations will have (large) non-gaussianities. Without a statistical model for how 
to deal with these issues, the numbers above just give order of magnitude estimates for 
deviations from typicality. 

Now, we can go to a more extreme case, where the 1/A^ corrections are much larger. We 
do this in the figure 4, which is worked for the case A^ = 1000, Q = 5. Notice that in this 
case the sizes of the different changes by various orders of magnitude at high k between 
different configurations. Hence the fiuctuations are very large and dominate any calculation. 
This is a situation where we expect that the averages {fk) do not converge for k large, as 
described in the previous section. Hence the behavior of the graph is as would be expected: 
the large values of fk are dominated by large fiuctuations rather than by the mean values. 

For the same extreme setup, we can also plot the radius as measured by the three different 
measurements that we suggested in the previous section. This appears in the figure 5. As can 
be read from the figure, we find that for various k, the definition of the radius Rq converges 
faster than the definition of the radius based on the roots of fk- We can prove this in general 
(the argument is found below). Moreover, we see that the naive definition of the inner radius 
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FIG. 4: A plot of logi{){fk) versus k for the case of = 1000, Q = 5 for 50 different configurations. 
Notice how for large k the magnitudes cover various decades. 

crosses the one defined by the fractions, even though configuration by configuration the naive 
definition gives a higher value. This is because the averaging procedure is different. Notice 
also how for low values of k the naive definition and the root definition agree for the most 
part. This is exactly what is expected when fluctuations are small. 
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FIG. 5: The radius Ro, r^'^\ r^'^^ of the hole relative to the radius of the sphere at Q = 0, evaluated 
at different values of k before taking the limit . We see that the values of Rq are lower than 
and show faster convergence, while the naive definition of the radius converges towards zero 
and is dominated by fluctuations for large k. 

Now, let us prove that the inequalities i?ofc < ^t+i- Notice that configuration by configu- 



ration we have the following equality 
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We also have the the inequahty given by 

Tk+l < Tk (22) 

This is obvious, as the r^+i is dominated more strongly by the particles near the inner 
edge than r^. It can also be proved mathematically using Holders inequality or Jensen's 
inequality. 

From here, it follows that 

\4^<rk+i (23) 

y jk+i 

And this inequality persists when we take averages. Hence the claim. Also, in the limit 
A; — > oo, both definitions would converge to the same value (before averaging the limit is 
exclusively dominated by the innermost particle), but the one based on the ratios converges 
faster (what we called Rq). Hence the definition Rq is the most numerically economic 
calculation of the radius of the configuration, as promised earlier. For completeness, we 
show now the plot in figure 5 with the variance of the data set included. 
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FIG. 6: Plot of Rqu and ' with the variance of sample included: this would indicate the error of a 
single measurement on the quantum system. It essentially accounts for the quantum fluctuations. 
This is the same data set used to generate figure 5. 

As should be recognized from the analysis above, the choice of variable to measure a 
geometric property in a model of emergent geometry with quantum fiuctuations can receive 
very large corrections due to the fiuctuations. This depends on how we average over con- 
figurations. This is determined in essence by the observable that one chooses to probe the 
system with. Although this is not surprising, the fact that fiuctuations can be rather large 
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even for large Values of (A^ = 10000 in our example) show that even though they are 
nominally suppressed by l/N^, the coefficient in front of this suppression can be very large 
depending on the 'quantum numbers' of the observable. This suggests that for many detailed 
geometric questions we might need to consider very large values of A^. For simulations on 
computers, this requires a lot of resources and the development of fast algorithms to perform 
the task efficiently. 



B. The approach to large N: fixed q simulations. 

The next step in our analysis is to check how the results converge to large values for 
objects that are comparable. In this case, we need to keep q = Q/N fixed to match the 
thermodynamic limit. We choose g = 0.1 and plot our results for various values of A^. This 
is shown in figure 7. As can be seen for the figure, as A^ gets larger, the reported value of the 
radius of the configuration increases. This is expected for the simple reason that at smaller 
A^ fiuctuations of particles near the inner radius are bigger, and the observables we have are 
sensitive to these fiuctuations. Notice also that the fiuctuations in individual measurements 
as a function of A^ decay with A^. However, from the data with the smallest value of A^ 
to the highest value of A^ we have one decade in difference, and that seems to be the same 
difference in the size of the fluctuations. 




FIG. 7: The approach to large N. We show data for Rq for different values of k, at q = 0.01 
and N = 1000, 2000, 3000, 4000, 5000, 6000, 10000 from bottom to top in the graph. The error bars 
signify the variance of the measurements. Again, this signifies the size of the quantum fluctuations. 
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We can also see that the differences between the observables calculated at different N 
that are separated by the same amount are getting smaller as we increase N. This strongly 
suggests that there is convergence to a large N limit, but graphically we see that the con- 
vergence is rather slow. 

This suggests that the deviations go as rather than iV~^. This is rather surprising. 
This result should be contrasted with the results [45], where the limit N ^ oo should be a 
classical system and we should be able to learn everything about the system expanding in 
powers oll/N"^. 

However, in a sense, the model is not a matrix model in the t' Hooft limit [42]. We 
have taken the t' Hooft coupling A = g'^N — > oo keeping N fixed and large, so that we 
can reduce our model to the set of commuting matrices. But a similar step of expanding 
about commuting matrices was done in [17] for a non-supersymmetric model. There it 
was found that the joint eigenvalue distribution obtained by taking this procedure was not 
exactly the correct size and that ignoring the loops of the variables that were integrated 
out gave the wrong answer. If we assume that something similar is happening here, where 
supersymmetry makes cancellations between integrated out modes happen, then we might 
still find problems if cancellations are not exact. This might be more pronounced at the edges 
of the distributions where small corrections have the biggest effects. At this stage, we are 
just speculating on what might be the cause for this feature of the numerical simulations. 
Another issue relates to the observations in the figure 4 which show that for large k we 
are not in the factorization regime yet for small N. Thus the idea of seeing a natural 
convergence to a standard large N expansion is suspect because is not large enough yet. 
This is a general issue that one should be very cautions about before making claims of some 
effect. Different observables converge at different rates, and the corrections depend on the 
parameters describing the observable. For example, if in our setup above corrections go like 
k^/N, and s is large, then we need to go to very large A^ before making any claims, because 
for k — 10 the result could easily overwhelm A^. Either way, for certain questions, a very 
large value of A" will be required before being able to compare to A" = oo. Considering 
also that the total number of degrees of freedom in the system before gauging is A^^ + 5A^ 
and that the 'energy of the ground state' is of order A^^ plus corrections of order A^, it is 
perhaps not surprising that the naive planar counting argument [42] fails to give the right 
approach to the large A^ limit. After all, we are taking the planar parameter to infinity first 
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(we are integrating out heavy modes and assuming they have no corrections to the truncated 
theory), and we keep N finite. Still, this is a puzzle that we can not resolve satisfactorily 
yet. Further study is required to understand this issue better. 

With our data, we can find a fit to the exponent and see what happens. We can do the 
following analysis. We can assume that there is a limit and a scaling exponent that describes 
how close we are to the limit, by the following argument 

AC 

^-.^^00 + ^ + ... (24) 

Then, if we take the following differences 

5iv-52iv = (l-2-°)-^ (25) 

we can make a fit for a and check the exponent appearing in the corrections. The data is 
not very illuminating. We find values as large as a ~ 0.7 for low values of A; up to A; = 6, and 
then it decreases towards a ~ 0.1 for /c = 13. Again, we should not give too much meaning 
to these numbers for the large values of k if the corresponding correlators are not factorized 
yet. 

Similarly, the standard-deviation fiuctuations should have some scaling pattern 

- ^ (26) 

If we plot the data sample to the best fit for /3 we get the results in figure 8. The results 
are roughly consistent with /3 ~ 1, which is the exponent that the planar counting would 
suggest, but it is unclear how this data should be used, seeing as we might not yet have 
converged to large sufficiently. 

Again, rough consistency with ^ ~ 1 is not the same as knowing that (3 — 1. A much 
larger data set from more simulations would be able to verify this hypothesis. It would also 
require going to much large values of N (hopefully stepping up to a factor of 100 in N might 
be enough). 

If the planar counting does not work, it could be because we ignored some degrees of 
freedom that contribute to the limit, and that these make the full system behave as is 
expected. However, we should expect that the corrections induced by integrating out heavy 
modes involve their masses, which are expressed as powers of the large N coupling constant 
A = g'^N [5, 18]. We do not have a clear understanding on how the planar counting would 
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FIG. 8: Scaling exponent for the variance of Rq at various values of k. 

be restored once we take A finite. It would require subtle cancellations to take place in 
order to do that. The upshot is that this is an interesting observation that the numerical 
exploration is giving us. It would be very hard to guess this behavior for observables with 
our current understanding of large N systems [45] . The one thing that is very special of this 
setup is that when we look at the eigenvalue density distribution in the saddle point limit, it 
is singular: it has a 6 function behavior. Singular distributions get smeared by fluctuations 
and because they are singular, they might induce anomalous scalings of fluctuations, etc. 
Again, a thorough understanding of these issues requires further work. 

C. A scaling relation: size of the hole for varying q at fixed 

The next item in the list is to check some of the geometric features of our ensemble 
and compare them to the expected solutions of supergravity and in particular with the free 
fermion picture [31, 40]. The idea is to fix and vary Q, thereby exploring the parameter 
q = Q/N . We do this for our largest value of = 10000 and for the largest k that we 
recorded k = 13. We plot Ro{q) versus Q in the figure 9. 

A fit of the form Ro{q) ~ q" shows that the best fit is with a ~ 0.47 ±0.1, where the 
error is an estimate of the systematic error. We use the variance in the inner radius as a 
measurement of the systematic uncertainty. This is consistent with a = 0.5. Incidentally, 
a = 0.5 is the scaling value for the free fermion droplets. The reason for this is that the free 
fermion system (quantum hall system) is equivalent to a Coulomb gas in two dimensions. 
For that system both particles and holes form incompressible fluids, and the radius of the 
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FIG. 9: Plot showing Dependence on the size of hole to the parameter Q , at = 10000 

inner hole in these systems also scales like R ~ Q^'^ where Q is the number of holes. The 
full data for all k for the same data set is shown in the figure 10. This can be used to 
estimate errors. As can be seen, for large Q the data has not converged yet, so the values 
quoted in the figure 9 are high and taking care of this more systematically has a tendency 
to reduce the exponent a. It is hard to extrapolate to large k, because we do not have an 
exact functional form to match. This requires understanding the statistics and the large N 
limit better. As seen in the previous section, this is rather subtle. 
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FIG. 10: The function Ro{Q) at various values of k, for Q = 10, 20, 30, 40, 60, 80, 100, N = 10000 

In a previous paper [6] , a different measurement was done by keeping Q fixed but varying 
A^, and a scaling exponent of 1/4 was reported. This expression involved small values of 
N, so it is very likely that the system was subject to large fluctuations. In this paper we 
have seen how this is an issue in general, and the deviations can be very large. Also, the 
convergence to large is slow, so the dependence on q = Q/N is not reliable in the paper 
[18]. The range of g = Q/N explored in that paper was similar to the one of this paper. 
Neither result is conclusive. In this paper the radius has not converged and the systematic 
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errors of extrapolation make the determination of a done here somewhat unrehable. The 
best way to reduce errors is to have larger simulations and to improve the understanding of 
the hmit — > oo. What we see is that ~ 14 is still too low. We did not know this ahead 
of the simulation and unfortunately larger values of k were not recorded. This decision 
was made ahead of the simulations: we only stored moments of distributions rather than 
full configurations. Full configurations generate very large data files in order to get large 
enough statistics. Due to various limitations, we opted for this approach. In a sense, we 
have performed this analysis as a blind analysis on the data. Consider however, that the 
information we have gathered is important from another perspective. If one sets out to do 
large simulations, the problems of data management is an issue. Our exploration gives us 
some rather precise information of how much data we need to keep and in what type of 
format. 

As a further thought, one expects that there are corrections to the scahng behavior that 
would involve some power series of q. Such corrections would require much more data to do 
a proper fit, as well as a systematic understanding of the functional form of the correction. 
If this is studied, the dominant scaling exponent (that with the smallest power of a) would 
be revealed for q small. 

It would certainly be interesting to study these puzzles further and to match some prop- 
erties to the gravity theory in observables that can be matched on both sides with better 
understanding (for example the energies of string states, as was done for the ground state 
in [18]). 

IV. CONCLUSION 

In this paper we have examined the problem of extracting geometric data from a system 
that exhibits emergent geometry in a thermodynamic limit where the number of particles N 
in a statistical mechanical system becomes large. The geometric data we computed was one 
single parameter: the radius of a feature, that depends functionally on a variable defining a 
one parameter family of wave functions of the system. 

In order to measure this geometric feature, we needed some scheme to define the shape 
parameter at finite N . We chose observables that were simple to calculate numerically 
and that have a reasonable behavior at large N as given by our theoretical expectations 
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of what constitutes a reasonably good observable. Our work has shown that extracting 
the geometric data requires making judicious choices of these observables. Otherwise, one 
might find pathological distributions whose averages are infinity entering into the problem 

when N becomes finite, even though when is very large they seem to be well behaved. 
Learning how to avoid pathological measurements requires some trial and error and wc gave 
some examples of how to improve this simple class of observables so that they had better 
behavior. 

We also studied how large were the fluctuations on the observables we chose and how fast 
the results approached the large limit. Prom the data we could analyze, the corrections to 
large N are controlled by various powers of 1/N, and some of them seem to be inconsistent 
with the naive planar counting lore. Surprisingly, the convergence to large was slower than 
expected, and the fact that some observables had pathologically large fluctuations renders 
a lot of the analysis inconclusive: even with N ~ 10000, N was not large enough to see a 
clear pattern emerging from the simulations. Prom this point of view we have learned a lot 
about the scale of simulations that would be required to sort out these features with the 
same procedures in future studies. We should have much larger simulations, with N ~ 10^. 

Part of the reason why such large numbers arc required should be obvious to anyone that 
tries to understand higher dimensional geometries of dimension d by adding a discretized 
grid approximation to the geometry. If we want to double the spatial resolution, we need a 
factor of 2*^ enhancement on the number of cells. As d becomes larger, this is an issue. A 
value of ~ 10^ is small to refine five dimensional geometries by these standards. 

Clearly, any future study should require smarter and faster algorithms than what we 
have used so far. We have used a simple Monte Carlo that runs on a single processor for 
these simulations. A better algorithm would involve parallel processing, a faster procedure of 
generating data sets and it might also require some coarse graining improvements to perform 
calculations with a study of the consequent systematic errors introduced by approximations. 

Another feature that we discovered is that it is very likely that there are better ways of 
analyzing the data than what we did. Particularly, features of the data as shown in figure 
1 suggest that that the density profile in the various variables is a rich area to explore. If 
we improve our theoretical understanding of the data this might be exploited to address 
issues of geometry further. Por example, in our analysis the fact that the geometry is a 
submanifold of was not crucial. However, it is a robust theoretical feature that should 
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be exploited to analyze the system better. 

For geometries with more features than a single hole in the center of the distribution, it 
would be interesting to figure out if one can find a fit to the wave function from numerical 
data. Such a dictionary would help us understand a lot better the relation between the geom- 
etry and the original quantum setup. Such an approach will obviously use more information 
about how the configurations were generated to infer (or reconstruct) information. 

Notice also that we are not considering either coarse grained geometries (statistics over 
multiple states that share common features) nor random statistical states in the ensemble. 
For such states, other techniques are available that show how hard it is to measure micro- 
scopic geometries [44]. Distinguishing the geometries obtained from bona- fide 'geometries' 
from those obtained from random superpositions of states should also be interesting. We 
are currently looking into these matters. 
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